A (acronyms)

نویسندگان

  • Manuel Zahariev
  • William S. Havens
چکیده

Acronyms are a significant and the most dynamic area of the lexicon of many languages. Building automated acronym systems poses two problems: acquisition and disambiguation. Acronym acquisition is based on the identification of anaphoric or cataphoric expressions which introduce the meaning of an acronym in text; acronym disambiguation is a word sense disambiguation task, with expansions of an acronym being its possible senses. It is proposed here that acronyms are universal phenomena, occurring in all languages with a written form, and that their formation is governed by linguistic preferences, based on regularities at the character, phoneme, word and phrase levels. A universal explanatory theory of acronyms is presented, which rests on a set of testable hypotheses, and is manifested through a set of violable, ordered rules. The theory is developed based on examples from fifteen languages, with six different writing systems. A dynamic programming algorithm is implemented based on the explanatory theory of acronyms. The algorithm is evaluated on lists of acronyms-expansion pairs in Russian Spanish, Danish, German, English, French, Italian, Dutch, Portuguese, Finnish, and Swedish and achieves excellent performance. A two-pass greedy algorithm for automatic acronym acquisition is designed, which results in good performance for specific domains. A hybrid, machine learning algorithm – using features generated through dynamic programming acronym-expansion matching – is proposed and results in good performance on noisy, parsed, newspaper text. A machine learning algorithm for acronym sense disambiguation is presented, which is trained and evaluated automatically on information downloaded following search engine lookup. The algorithm achieves good performance on deciding whether an acronym occurs with a certain sense in a given context, and good accuracy when picking the correct sense for an acronym in a given context. All algorithms presented allow for efficient, readily usable implementations that can be included

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Translation of Acronyms, Initialisms and Abbreviations (AIA) in Persian Political and Sport Journalistic Texts

The different writing systems of English and Persian makes translation of acronyms, initialisms and abbreviations challenging. This study aimed at finding which strategies were applied most frequently in translating acronyms, initialisms and abbreviations from English to Persian especially in journalistic texts. The study was done based n Descriptive Translation Study of Toury and strategies pr...

متن کامل

The potentially coercive nature of some clinical research trial acronyms.

STUDY OBJECTIVES To evaluate the potential coerciveness of clinical research trial (CRT) acronyms, and to make clinicians aware that some CRT acronyms may be coercive to research subjects by subliminally enticing or outwardly promising something that the CRT may not be able to deliver. DESIGN Analysis of CRT acronyms for pleasantness and meaningfulness as assessed by studies in the behavioral...

متن کامل

Is there room for the BBC in the mental lexicon? On the recognition of acronyms.

It has been suggested that acronyms like BBC are processed like real words. This claim has been based on improved performance with acronyms in the Reicher-Wheeler task, the letter string matching task, the visual feature integration task, and the N400 component in event-related potential (ERP) studies. Unfortunately, in all these tasks performance on acronyms resembled performance on pseudoword...

متن کامل

Prediction of Word Stress in Swedish Acronyms – or the Difference between a BMX and a BMW

This paper concerns the prediction of stress in Swedish three-letter acronyms. A segmental hypothesis is tested against an “establishedness” hypothesis suggested by Bruce (1993). Subjects were found to rely heavily on segmental cues for selection of stress pattern, especially for familiar acronyms. Establishedness, approximated as familiarity, did not have any impact on stress pattern selection.

متن کامل

SearCh for humourIstic and Extravagant acroNyms and Thoroughly Inappropriate names For Important Clinical trials (SCIENTIFIC): qualitative and quantitative systematic study

OBJECTIVES To describe the development of acronym use across five major medical specialties and to evaluate the technical and aesthetic quality of the acronyms. DESIGN Acronyms obtained through a literature search of Pubmed.gov followed by a standardised assessment of acronym quality (BEAUTY and CHEATING criteria). PARTICIPANTS Randomised controlled trials within psychiatry, rheumatology, p...

متن کامل

Are acronyms really irregular? Preserved acronym reading in a case of semantic dementia.

This paper describes the progressive performance of JD, a patient with semantic dementia, on acronym categorisation, recognition and reading aloud over a period of 18 months. Most acronyms have orthographic and phonological configurations that are different from English words (BBC, DVD, HIV). While some acronyms, the majority, are regularly pronounced letter by letter, others are pronounced in ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004